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Abstract 

The problem of computing the smallest fixed point of a monotone 
map arises classically in the study of zero-sum repeated games. It 
also arises in static analysis of programs by abstract interpretation. 
In this context, the discount rate may be negative. We characterize 
the minimality of a fixed point in terms of the nonlinear spectral radius 
of a certain semidifferential. We apply this characterization to design 
a policy iteration algorithm, which applies to the case of finite state 
and action spaces. The algorithm returns a locally minimal fixed 
point, which turns out to be globally minimal when the discount rate 
is nonnegative. 
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1 Introduction 



Zero-sum repeated games can be studied classically by means of dynamic 
programming or Shapley operators. When the state space is finite, such an 
operator is a map / from R d to R d , where d is the number of states. Typically, 
the operator / can be written as: 

ti(x) = mm max F- x + r,- 

aeAibEB i>a 1 1 

Here, A, represents the set of actions of Player I (Minimizer) in state i, B i>a 
represents the set of actions of Player II (Maximizer) in state i, when Player 
I has just played o (the information of both players is perfect), r"' b is an 
instantaneous payment from Player I to Player II, and P"' b = (Pfflj is a 
substochastic vector, giving the transition probabilities to the next state, 
as a function of the current state and of the actions of both players. The 
difference 1 — J2j ^if gives the probability that the game terminates as a 
function of the current state and actions. The operator / will send R d to M. d 
if for instance the instantaneous payments are bounded. We may consider 
the game in which the total payment is the expectation of the sum of the 
instantaneous payments of Player I to Player II, up to the time at which 
the game terminates. This includes the discounted case, in which for all 
i, J2jP?j b = ct < 1, for some discount factor a. Then, the fixed point of 
/ is unique, and its ith-coordinate gives the value of the game when the 
initial state is i, see [FV97]. In more general situations [MS97], the value 
is known to be the largest (or dually, the smallest) fixed point of certain 
Shapley operators, and it is of interest to compute this value, a difficulty 
being that Shapley operators may have several fixed points. 

The same problem appears in a totally different context. Static analysis 
of programs by abstract interpretation [CC77] is a technique to compute 
automatically invariants of programs, in order to prove them correct. The 
fixed point operators arising in static analysis include the Shapley operators 
of stochastic games as special cases. However, the "discount factor" may 
be larger than one, which is somehow unfamiliar from the game theoretic 
point of view. In this context, the existence of the smallest fixed point is 
guaranteed by Tarski-type fixed point arguments, and this fixed point is 
generally obtained by a monotone iteration (also called Kleene iteration) of 
the operator /. This method is often slow. Some accelerations based on 
"widening" and "narrowing" [CC92] are commonly used, which may lead to 
a loss of precision, since they only yield an upper bound of the minimal 
fixed point. Some of the authors introduced alternative algorithms based 
on policy iteration instead [CGG + 05, GGTZ07], which are often faster and 
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more accurate. However, the fixed point that is returned is not always the 
smallest one. 

In the present paper, we refine these policy iteration algorithms in order 
to reach the smallest fixed point of / even in degenerate situations. Our 
work is partly inspired by [AGN], where the uniqueness (rather than the 
minimality) problem for the fixed point of a nonexpansive maps is studied 
using semidifferentiability techniques. 

We note that an alternative approach to compute the smallest fixed point 
has been developed by Gawlitza and Seidl [GS07]. However, this reference 
concerns the integer fixed points of integer preserving maps, whereas we work 
here with real fixed points. 

2 Basic notions 

In this paper, we will work in M. d equipped with the sup-norm || • ||. We 
consider the natural partial order on M. d defined as: x < y if for all i, 1 < 
i < d, Xi < Hi where Xi indicates the ith. coordinate of x. We write x < y 
when x < y and there exists a j such that Xj < yy We denote by IR + (resp. 
R_) the set of real nonnegative (resp. nonpositive) numbers. All vectors of 
R d such that f(x) = x are called fixed points of the map /. We denote by 
Fix(/) the set of fixed points of /. 

When the action spaces are finite, dynamic programming operators are 
not differentiable, and they may even have empty subdifferentials or su- 
per differentials. However, their local behavior can be analyzed by means of 
a non-linear analogue of the differential, the semidifferential. In order to 
define it, let us recall some basic notions concerning cones. 

Definition 2.1. A subset C ofM. d is called a cone if for all X > and for 
all x G C, Xx G C. A cone C is said to be pointed if CH—C = {0}. 

Definition 2.2. Let C be cone and g be a self-map on C. The function 
g is said to be homogeneous (of degree one), if for all strictly positive real 
numbers X, g(Xx) = Xg(x). 

Closed convex pointed cones are precisely what we need to define spectral 
radii of homogeneous continuous self- maps g on C. 

Definition 2.3 (Spectral radius). Let C be a closed convex pointed cone. 
Let g be a homogeneous continuous self-map on C. We define the spectral 
radius pc{g) 1° be the nonnegative number: 

p c {g) = sup{A > | 3 x G O\{0},g(x) = Xx} 
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A vector x G C\{0} such that g(x) = Ax is a non-linear eigenvector of 
g, and A is the associated non-linear eigenvalue. The existence of non-linear 
eigenvectors is guaranteed by standard fixed point arguments [Nus88]. 

We next recall the notion of semidifferential, see [RW98] for more back- 
ground. 

Definition 2.4 (Semidifferential). Let u G M. d and f be a self-map on M. d . We 
say that f is semidifferentiable at u if there exists a homogeneous continuous 
map g on M. d and a neighborhood V of such that for all h G V: 

f(u + h)=f(u)+g(h)+o(\\h\\) 
We call g the semidifferential of f at u and we note it f' u . 

If / is semidifferentiable at u, we have for alH > and for h in a small 
enough neighborhood of 0: 

f(u + h) = f(u)+tfl(h)+ (t\\h\\) 

This implies 

m = lim+ /(»+ "0-/00 

and the semidifferential coincides with the directional derivative of / at u in 
direction h (on the positive side). The following result shows that semidif- 
ferentiability requires the latter limit to be uniform in the direction h. 

Proposition 2.1 (see [RW98, Theorem 7.21]). Let f be a self-map on M rf . 
Let u be in R d . f' u , the semidifferential of f at u exists if and only if for all 
vectors h, the following limit exists: 

lim f(u + th>) - f(u) 

t^o+ t 
h'^h 

3 Main Results 

Definition 3.1. Let u G Fix(/). We say that u is a locally minimal fixed 
point if there is a neighborhood V of u such that for all v G V H Fix(/) ; 
V < U V = u. 

Now, for a semidifferentiable map / at u, we denote Fix| M d (f' u ) the set 
of the negative fixed points of f u . 

Theorem 1. Let u G Fix(/). Consider the following statements: 
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1. u is a locally minimal fixed point. 

2. Fix |Rd J^) = {0}. 

3. p Ml (/;)<l. 

Then 3 =>- 2 =>- 1. 

Proof. Point 3 implies point 2 indeed. In order to show that 2 implies 1, 
assume that it is not a locally minimal fixed point. Then, there exists a 
sequence h n of non-zero vectors in Ml tending to the zero vector such that 
u + h n is a fixed point of /. After replacing h n by a subsequence, we may 
assume that y n := \\h n \\~ 1 h n has a limit y. Then, ||y|| = 1 and y G M^. 
Writing u+h n = u+\\h n \\y n , and using Prop. 2.1, we conclude that y = fu(y), 
showing that f' u has a non-zero fixed point in Ml. □ 

In the previous theorem, there are no restrictive conditions on the map, 
but we only get a sufficient condition for the local minimality of a fixed point. 
We next consider the special case of piecewise affine maps. 

Definition 3.2. Let f be a map on M, d . We say that f is piecewise affine 
if for all j G {1, • • • ,d} there exists finite sets Aj, {B a } ae Aj an d a family 
{9a,b}(a,b)eA,xB a of affine maps such that 

fj = minmaxtfeft 

aeAj b£B a 

It is shown in [Ovc02] that the set of piecewise affine maps that we define 
is the same as the set of functions / for which there exists a family of con- 
vex closed sets with non empty interior which covers M. d and such that the 
restriction of / on each element of this family is affine. 

Proposition 3.1. Let f be a piecewise affine self-map on ~R d . Then f is 
semidifferentiable at all u G M d . Moreover for all u G ~R d : 

1. Let Aj = {a G Aj \ fj(u) = max beBa g a ,b(u)} and B a = {b G B a \ 
9a,b( u ) = max beB a g a ,b( u )} > then 

(fl)j = min max V# a ,f 

aeAj beB a 

2. There is neighborhood V of u such that, for all u + h G V , f{u + h) — 

The first assertion may be deduced by applying the rule of the "differen- 
tiation" of a max see Exercise 10.27 of [RW98]. The following proof that we 
provide for the convenience of the reader, is a variation of this argument. 



5 



Proof. We set, for all a G Aj, g a (x) = maxb e £ a g a ^{x). By definition of B a , 
there exists a neighborhood V a of u such that: g a (u+h) = max bg ^ a g a ^{u+h), 
for all /i such that u + h E V a . Since is affine, we have: g a ,b{u + h) — 
ga,b{ u ) + Vg aj fc • /i. It follows that, for all b G B a : 

g a (u + h) = g a - b (u) + max Vg a , b -h ( 1 ) 

By definition of Aj, there exists a neighborhood of u, V C fla^' such 
that: + /i) = min a6 ^ g a (i( + /i) if u + /i G V. Applying (1), we get: 

fj(u + h) = fj(u) + min a6 ^ max be ^ a Vg a> b • h if u + h G V, which shows the 
two assertions. □ 

Corollary 3.1. Let f be a piecewise affine map on M. d and let u G Fix(/) ; 
then u is a locally minimal fixed point if and only Fix| R d (/£) = {0}. 

The second part of the previous proposition is the basis of a "descent" 
algorithm given in the section 4. If a fixed point u is not locally minimal, 
then there exists a strictly negative fixed point h for f' u which may be thought 
of as a descent direction such that u + h is a fixed point of /. 

In order to pass from local minimality to global minimality, we shall need 
the following nonexpansiveness condition. 

Definition 3.3. Let f be a map on M. d : f is nonexpansive (with respect to 
the sup-norm) if for all x, y G M. d , \\f(x) — f(y) \\ < \\x — y\\. 

Proposition 3.2. Let f be a nonexpansive map on M d and let u G M d , then 
FiX| Rl (f' u ) = {0} if and only if p^<i_{f' u ) < 1. 

Proof. It suffices to show that p R d (f' u ) < 1. Firstly, f' u is nonexpansive 
because it is a pointwise limit of nonexpansive mappings. Assume by con- 
tradiction that p W d (f' u ) > 1, so there exists /i > 1 and v G Ml such that 
fu(v) = /if, then \\fu(v) — /^(0)|| = p\\v\\ > \\v — 0\\ which contradicts the 
nonexpansiveness of /„. □ 

The main result of this paper is the following theorem, which will allow 
us to check the global minimality of a fixed point. 

Theorem 2. Let f be a monotone nonexpansive self-map on M, d . Let u be a 
fixed point of f . Then, u is locally minimal if and only if it is the smallest 
fixed point of f . If in addition f is piecewise affine, the following assertions 
are equivalent: 

1. u is the smallest fixed point of f. 
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2. Fix |Kd _(/^) = {0}. 



3. 1. 

The proof of this theorem relies on the existence of a monotone and 
nonexpansive retract on the fixed point set of /. This idea was already used 
in [CGG + 05]. The existence of nonexpansive retracts on the fixed point set 
is a classical topic in the theory of nonexpansive mappings, see Nussbaum 
[Nus88]. In the present finite dimensional case, the result of the next lemma 
is an elementary one. 

Lemma 3.1. Let f be a nonexpansive monotone map on M, d . Let u be in 
Fix(/). We suppose Fix(/) ^ 0. Then there is a nonlinear monotone and 
nonexpansive map P such that P(R. d ) = Fix(/) and Fix(P) = Fix(/). 

Proof. Following the idea of [GG04], we shall construct the map P as follow: 
P(x) = limfc^ +00 f k (y) where y = limsup^ +00 f l (x). Since / is nonexpan- 
sive and u G Fix(/), {f k {x)}k>o is bounded for all x G M. d . We can now 
write, for all x G M d , Q(x) = limsup fc _ >+00 f k {x). Moreover, given k > 0, we 
have, for all m > k, sup ra>fc f n (x) > f m (x) and since / is monotone, for all 
m > k, /(sup n > fe r(x)) >f(f m (x)) so, /(sup n > fc r(a;)) > sup m > fe f(f m (x)), 
we conclude, by taking the limit when k tends to +oo and using the con- 
tinuity of /, f(Q(x)) > Q(x), so {f\Q(x)}k>o is an increasing sequence. 
Moreover, since / is nonexpansive, the limit P(x) = lim^ +00 f l (Q(x)) is 
finite. Observe that the map P is monotone and nonexpansive since it 
is the pointwise limit of monotone and nonexpansive maps. Furthermore, 
f(P(x)) = f(]imf(Q(x)) = \im f l+1 Q(x) = P(x) so P(R d ) C Fix(/). 
Moreover, it is easy to see that P fixes every fixed point of /. It follows 
that P is a projector. □ 

Proof of Theorem 2. Suppose that u is a locally minimal fixed point but not 
the smallest fixed point. Then, there is fixed point v such that inf(i>, u) < u. 
For all scalars t > 0, define u>t :— inf (v+t, u). Let us take P as in Lemma 3.1. 
Since P is nonexpansive for the sup-norm, \\P(v + t) — P(v) \\ < t for all t > 
and so P(v + t) < P(v) + t. Using the monotonicity of P, we deduce that 
P(u t ) < mi(P(v + t),P(u)) < M(P(v) + t,P{u)) = u t . Let t = inf{t > 
| u t = u}. Then, for < t < t , P(oJt) is a fixed point of /, which is such 
that P(u) t ) < u. Since P is continuous, P((v t ) tends to P(uj t() ) = P(u) = u 
as t tends to t$ , which contradicts the local minimality of u. Hence, u is the 
smallest fixed point. Define by 1' the property that u is a locally minimal 
fixed point. We just showed 1 1'. By Theorem 1, we get 3 =^ 2 =^ 1'. 
By Corollary 3.1, V =>• 2 and by Prop 3.2 we get 2^3. □ 
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4 A policy iteration algorithm to compute 
the smallest fixed point 

The previous results justify the following policy iteration algorithm which 
returns the smallest fixed point of a nonexpansive monotone piecewise afline 
map. Assume that / is a map from M. d to M. d , every coordinate of which is 
given by 



where Aj is finite and every f a is a supremum of monotone nonexpansive 
affine maps. A strategy 7r is a map from {1, • • • , d} to A — Ui<j<d^i sucn 
that e Ay We define f n = • • • , f n{j) , ■■■ , f n(d) ). The algorithm, 

needs two oracles. Oracle 1 returns the smallest fixed point in M. d of a map 
f w . Oracle 2 checks whether the restriction of f' u to the convex cone Ml has a 
spectral radius equal to 1, and if this is the case, returns a vector h G lR d \{0} 
such that fu(h) = h. We discuss below the implementation of these oracles 
for subclasses of maps. 

Algorithm 1 Computing the smallest fixed point by Policy Iteration 



INPUT: f in the form (2). OUTPUT: The smallest fixed point of / in R d . 
INIT: Select a strategy tt°, k = 0. 

VALUE DETERMINATION (D k ): Call Oracle 1 to compute the small- 



POLICY IMPROVEMENT (l£): If f(u k ) < u k define vr fc+1 such that 
f{u k ) = r k+ \u k ) and go to Step (D k+1 ). 

POLICY IMPROVEMENT (l£): If f{u k ) = u k , call Oracle 2 to com- 
pute ttfc := p(f'k)- If Oik < 1, returns u h , which is the smallest fixed point. 
If a k = 1, take\ € M d \{0} such that /£ fc (ft) = /i. Define 7r fc+1 (j) to be any 
action a which attains the minimum in (/' k )j(h) = min ae ^. (f^) u k(h) where 
Aj = {a | f a {u k ) = f{u k )}. Then go to (iC+i)- 

To show that Algorithm 1 terminates, it suffices to check that the se- 
quence u ,^ 1 ,--- is strictly decreasing, because the corresponding policies 
must be distinct and the number of policies is finite. If an improvement of 
type 1\ arises, then, the proof of the Theorem 3 from [GGTZ07] shows that 
u k+i < u k _ jf an i m provement of type i| arises, then by Prop 3.1 asser- 
tion 2, f Tk+L (u k + th) = u k + th for t > small enough. It follows that 

u k+l < u kj r tf l< u k 

Now, we discuss the implementation of the oracles. The following propo- 
sition identifies a situation where Oracle 1 can be implemented by solving a 
linear programming problem. 




(2) 



est fixed point u 



k of p\ 
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Proposition 4.1. Let g be a monotone nonexpansive map that is the supre- 
mum of finitely many affine maps. Assume, furthermore, that g has a small- 
est fixed point in M. d . Then this fixed point is the unique optimal solution of 
the linear program: min{^ 1<i<d xt \ x G WL d ,g(x) < x}. 

Proof. Let x denote the smallest fixed point. The vector x is clearly a fea- 
sible point of the previous linear program. If x is an another feasible point, 
consider y = lim^ g k (x). Since g is monotone, nonexpansive and has a fixed 
point, g k (x) is a bounded decreasing sequence so y < x and y is a finite fixed 
point of g so y > x. We get ^2 i x i < J2iVi ^ J2i x i- Since this holds for 
all feasible x, it follows that x is an optimal solution. If x is an arbitrary- 
optimal solution, we must have ^ X{ = J2i x i an d we deduce from y < x 
and y > x that x < x. It follows that x — x. □ 

The implementation of Oracle 2 raises the issue of computing the spectral 
radius. Let g be a monotone, homogeneous and continuous self-map of Wt. 
It is known that: 

(g) = inf j sup ^ (3) 



ieint(Ri) l<i<d Xi 



l 



jd (g) = sup limsup ||^ (x)|| fc (4) 



The first equality, which is a generalization of the Collatz-Wielandt prop- 
erty in Perron- Frobenius theory, follows from a result of Nussbaum [Nus86] 
Theorem 3.1. The second characterization is shown by Mallet-Paret and 
Nussbaum in [MPN02] under more general conditions. We deduce, for every 
vector x G int(Rl): 

p Rd _(g)< (sup fcMY. (5) 



K l<i<d Xi 

Moreover, the latter upper bound converges to p R d (g) as k tends to infinity. 
This yields an obvious method to check whether p R d (g) < 1, which consists 
in computing the upper bound in (5) for successive values of k as long as 
the upper bound is not smaller than 1. This algorithm will not stop when 
p R d (g) = 1. However, we next describe a simple situation where this idea 
leads to a terminating algorithm. 

Definition 4.1. We call a homogeneous min-max function of the variables 
h±, . . . , ha a term in the grammar: X \— > min(X, X), max(X, X), hi, • • • ,hd,0. 

For instance, the term min(/i 1 , max(/i 2 , h 3 , 0)) is produced by this gram- 
mar. More generally, we shall say that a map from R d to R d is a homogeneous 
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min-max function if its coordinates are the form of Def 4.1. This definition 
is inspired by the min-max functions considered by Gunawardena [Gun94] 
and Olsder [01s91]. The terms of this form comprise the semidifferentials of 
the min-max functions considered there. For simple classes of programs, like 
the one we shall consider in the next section, the semidifferential at any fixed 
point turns out to be a homogeneous min-max function. In this case, the 
spectral radius can be computed efficiently by using to the following integrity 
argument g(Z d ) C Z d . 

Proposition 4.2. Let g be a homogeneous min-max function on ~R d . Then 
Pm d (fl 1 ) £ {0) !}■ Moreover p R d (g) = iff limfc_ >+00 g k (e) = 0, where e is the 
vector all the entries of which are equal to -1, and the latter limit is reached 
in at most d steps. 

Proof. Since g is nonexpansive and g(0) = 0, we have g(e) > e. We deduce, 
by monotonicity of g, that {g k {e)}k>o is a nondecreasing sequence bounded 
from above by 0. Moreover, g preserves the set of integer vectors. So this 
sequence converges in at most d steps to some vector b e Z d _. Let us suppose 
that b — 0. For all y G Rl, there exists t > such that, > y > te. Since g 
is homogeneous and monotone, > g k (y) > tg k (e) = for all k > d which 
implies that p R d (g) = 0. If b < we have g(b) = b 7^ 0, and so p R d (g) > 1. 
Finally, since g is nonexpansive, we also have p R d (g) < 1. □ 

5 Example 

We next illustrate our results on an example from static analysis. We take 
a simple but interesting program with nested loops (Figure 1). From this 
program, we create semantic equations on the lattice of intervals [CC77] 
(Figure 2) that describe the outer approximations of the sets of values that 
program variables can take, for all possible executions. For instance, at 
control point 4, the value of variable y can come either from point 3 or point 
5 (hence the union operator), as long as the condition y > 5 is satisfied (hence 
the intersection operator). An interval / is written as [— i~,i + ] in order to 
get fixed point equations of monotone maps in i l and i + . The equations we 
derive on bounds are monotone piecewise affine maps, to which we can apply 
our methods. 

The monotone nonexpansive piecewise affine map / for the bounds of 
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int x , int y , 

x=[0,2];y=[10,15] //l 

while (x<=y) { //2 

x=x+l; //3 

while (5<=y) { //4 

y=y-i; ' //5 

> //6 

> //7 

Figure 1: A simple C program 



(x-i.y-t) = 


([0,21, 110,151) 

VL ' J' L ' J / 


x 2 = 


(xi U x 6 ) n [-00, (yi U y 6 ) + ] 


2/2 = 


(j/iUjfe) n [(xiUx 6 )",+oo] 


(ar3,3/3) = 


(x 2 + [l,l],y 2 ) 


(£4,2/4) = 


(a?3, (2/3 U 1/5) n [5, +00]) 


(£5,2/5) = 


(x 4 ,y 4 + [-1, -1]) 


(^6,2/e) = 


(x 5 ,(y 3 Uy 5 ) n [-oo,4]) 


x 7 = 


(x 1 Ux 6 )n[(yiUy 6 )- + l,+oo] 


2/7 = 


(j/i U y 6 ) n [-00, (a;! U x 6 )+ - 1] 



Figure 2: Its abstract semantic 
equations in intervals 



these intervals is: 

V (xj - 1) \ 
A 15 V y£ 
A (-10Vy 6 -)-l 

v (4 + 1) 

A -10 V y 6 - 

V 2/ 6 + 
A ^5 

v yt - 1 
v 2/4" + 1 

A 4 

v y 6 

A (2 V (4 + !))-! / 

In the equations for the intervals x 2 , 2/2, 2/4, 2/6, x 7 an d 2/7, an intersection 
appears, which gives a min (A) in the corresponding coordinate of /. Choos- 
ing a policy is the same as replacing every minimum of terms by one of the 
terms, which yields a simpler "minimum-free" non-linear map, which can be 
interpreted as the dynamic programming operator of a one-player problem. 

We next illustrate Algorithm 1. The underlined terms in the expression 
of / indicate the initial policy 7r°, for instance, the fifth coordinate of f w is 
V (#2 — !)• We first compute the smallest fixed point of this policy (Step 
(Do)). This may be done by linear programming (Prop. 4.1), or, in this 
special case, by a reduction to a shortest path problem. We find (x,y) = 



f 



f 



l X 2 \ 








X >j 








2/2" 




vt 




2/4 




: yt 




2/ 6 




; yt 




y 7 




\yt J 







2 V (4 + 1) 
V (xj - 1) 


V (X2 



1) 



15 

2/ 2 " V (yl + 1) 

yt 
2/2 

4 V (y+ - 1) 
-10 
15 V y+ 
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(0, 15, -1, 16, 0, 15, -5, 15, 0, 4, 0, 15) T . The first Policy Improvement step, 
(Jg), requires to check whether this is a fixed point of /. This turns out 
to be the case. To determine whether (x, y) is actually the smallest fixed 
point, we enter in the second policy improvement step, (Jq). We calculate 
the semidifferential at (x, y) in the direction (Sx,5y), using the first point of 
Prop 3.1: 

f[ m (Sx, 5y) J = (0, 0, Sy 6 , 5x+, A 5y 6 , 0, 0, 8y£, Sy 2 , 0, Sy 6 , A 5x+) J 

The method of Prop 4.2, which we use as Oracle 2, yields in three steps a fixed 
point for f, . that we denote by h = (0, 0, -1,0, -1, 0,0, 0, -1,0, -1, 0) T . 
This fixed point determines the new policy it 1 , which corresponds to the map 
9 ■= f wl given by: 



/ x 2 






( 


V 


x 2 1 


x 2 






15 
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We are now in a new value determination step, (-Di). We find the fixed 
point (u, v) = (0, 15, —5, 16, —4, 15, —5, 15, —4, 4, —4, 15) T of g, which is also 
a fixed point of / (Step (II)). In Step (If), we compute the semidifferential 
of / at (u, v), which is given by: 

f(u,v) ^) T = (°> 0. % . S ^t , H~> °> °> ^2 + > ^2 V 5v±,0, 8vq, A 5u+) J 

Calling again Oracle 2, we get pM. d (f(uv)) = 0' anc ^ so ; ^ ne algorithm 
stops: (u, v) is the smallest fixed point of /. 
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